Express Mail^f^el No. EL 499 467 463US 
SEQUENCE LISTING 



<110> HOSHINO, Tatsuo 
OJIMA, Kazuyuki 
SETOGUCHI, Yutaka 

<120> ASTAXANTHIN SYNTHETASE 

<130> ASTAXANTHIN SYNTHETASE 

<140> US/09/518,386 
<141> 2000-03-03 

<150> EP 99104668.1 
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<210> 1 
<211> 557 
<212> PRT 

<213> Phaffia rhodozyma 
<220> 

<221> TRANSIT 
<222> (1)..(26) 

<400> 1 

Met Phe lie Leu Val Leu Leu Thr Gly Ala Leu Gly Leu Ala Ala Phe 
15 10 15 

Ser Trp Ala Ser lie Ala Phe Phe Ser Leu Tyr Leu Ala Pro Arg Arg 
20 25 30 

Ser Ser Leu Tyr Asn Leu Gin Gly Pro Asn His Thr Asn Tyr Phe Thr 
35 40 45 
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# 



Gly Asn Phe Leu Asp lie Leu Ser Ala Arg Thr Gly Glu Glu His Ala 
50 55 60 

Lys Tyr Arg Glu Lys Tyr Gly Ser Thr Leu Arg Phe Ala Gly lie Ala 
65 - 70 75 80 

Gly Ala Pro Val Leu Asn Ser Thr Asp Pro Lys Val Phe Asn His Val 

85 90 95 

Met Lys Glu Ala Tyr Asp Tyr Pro Lys Pro Gly Met Ala Ala Arg Val 
100 105 110 

Leu Arg He Ala Thr Gly Asp Gly Val Val Thr Ala Glu Gly Glu Ala 
115 120 125 

His Lys Arg His Arg Arg He Met He Pro Ser Leu Ser Ala Gin Ala 
130 135 140 

Val Lys Ser Met Val Pro He Phe Leu Glu Lys Gly Met Glu Leu Val 
145 150 155 160 

Asp Lys Met Met Glu Asp Ala Ala Glu Lys Asp Met Ala Val Gly Glu 

165 170 175 

Ser Ala Gly Glu Lys Lys Ala Thr Arg Leu Glu Thr Glu Gly Val Asp 
180 185 190 

Val Lys Asp Trp Val Gly Arg Ala Thr Leu Asp Val Met Ala Leu Ala 
195 200 205 

Gly Phe Asp Tyr Lys Ser Asp Ser Leu Gin Asn Lys Thr Asn Glu Leu 
210 215 220 

Tyr Val Ala Phe Val Gly Leu Thr Asp Gly Phe Ala Pro Thr Leu Asp 
225 230 235 240 

Ser Phe Lys Ala He Met Trp Asp Phe Val Pro Tyr Phe Arg Thr Met 

245 250 255 

Lys Arg Arg His Glu He Pro Leu Thr Gin Gly Leu Ala Val Ser Arg 
260 265 270 



Arg Val Gly He Glu Leu Met Glu Gin Lys Lys Gin Ala Val Leu Gly 
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275 280 285 

Ser Ala Ser Asp Gin Ala Val Asp Lys Lys Asp Val Gin Gly Arg Asp 
290 295 300 

lie Leu Ser Leu Leu Val Arg Ala Asn lie Ala Ala Asn Leu Pro Glu 
305 310 315 320 

Ser Gin Lys Leu Ser Asp Glu Glu Val Leu Ala Gin lie Ser Asn Leu 

325 330 335 

Leu Phe Ala Gly Tyr Glu Thr Ser Ser Thr Val Leu Thr Trp Met Phe 
340 345 350 

His Arg Leu Ser Glu Asp Lys Ala Val Gin Asp Lys Leu Arg Glu Glu 
355 360 365 

lie Cys Gin lie Asp Thr Asp Met Pro Thr Leu Asp Glu Leu Asn Ala 
370 375 380 

Leu Pro Tyr Leu Glu Ala Phe Val Lys Glu Ser Leu Arg Leu Asp Pro 
385 390 395 400 

Pro Ser Pro Tyr Ala Asn Arg Glu Cys Leu Lys Asp Glu Asp Phe lie 

405 410 415 

Pro Leu Ala Glu Pro Val lie Gly Arg Asp Gly Ser Val lie Asn Glu 
420 425 430 

Val Arg lie Thr Lys Gly Thr Met Val Met Leu Pro Leu Phe Asn lie 
435 440 445 

Asn Arg Ser Lys Phe lie Tyr Gly Glu Asp Ala Glu Glu Phe Arg Pro 
450 455 460 

Glu Arg Trp Leu Glu Asp Val Thr Asp Ser Leu Asn Ser lie Glu Ala 
465 470 475 480 

Pro Tyr Gly His Gin Ala Ser Phe lie Ser Gly Pro Arg Ala Cys Phe 

485 490 495 

Gly Trp Arg Phe Ala Val Ala Glu Met Lys Ala Phe Leu Phe Val Thr 

500 505 510 



Page 3 




Leu Arg Arg Val Gin Phe Glu Pro 
515 520 

His lie Thr Leu lie lie Ser Arg 
530 535 

Glu Gly Tyr Gin Met Arg Leu Gin 
545 550 



He He Ser His Pro Glu Tyr Glu 

525 

Pro Arg He Val Gly Arg Glu Lys 
540 

Val Lys Pro Val Glu 
555 



<210> 2 
<211> 1932 
<212> DNA 

<213> Phaffia rhodozyma 

<220> 

<221> CDS 

<222> (33) . . (1706) 

<220> 

<221> polyA_site 
<222> (1871) 

<220> 

<221> mRNA 

<222> (14) . . (1891) 

<400> 2 

gaattcggca cgaggccacc tactttctcc at atg ttc ate ttg gtc ttg etc 
53 

Met Phe He Leu Val Leu Leu 
1 5 

aca ggt get tta ggc ctg get get ttc tea tgg gca tec ata gcg ttc 
101 

Thr Gly Ala Leu Gly Leu Ala Ala Phe Ser Trp Ala Ser He Ala Phe 
10 15 20 

ttc agt ctt tac etc get ccg agg cga tct tea ctg tat aac ctt cag 
149 

Phe Ser Leu Tyr Leu Ala Pro Arg Arg Ser Ser Leu Tyr Asn Leu Gin 
25 30 35 
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ggc 


ccg 


aat 




197 








Gly 


Pro 


Asn 




40 








tea 


get 


cgt 




245 








Ser 


Ala Arg 




age 


acc 


etc 




293 








Ser 


Thr 


Leu 




acc 


gat 


ccg 


SKI-. 


341 






.SKJI. 


Thr 


Asp 


Pro 








90 


?„.; !• 


ccg 


aaa 


cct 


O 


389 








Pro 


Lys 


Pro 






105 




q 


ggt 


gtt 


gtt 




437 








Gly 


Val 


Val 


nil 


120 








atg 


ate 


ccc 




485 








Met 


He 


Pro 




ttc 


tta 


gaa 




533 








Phe 


Leu 


Glu 




get 


gag 


aag 




581 








cat acc aac tac 

His Thr Asn Tyr 
45 

aca ggt gaa gag 

Thr Gly Glu Glu 
60 

egg ttt get ggg 

Arg Phe Ala Gly 
75 

aaa gtc ttc aac 
Lys Val Phe Asn 

ggt atg gee get 

Gly Met Ala Ala 
110 

acg gcg gaa ggt 

Thr Ala Glu Gly 
125 

tct ctg tec get 

Ser Leu Ser Ala 
140 

aaa ggt atg gaa 

Lys Gly Met Glu 
155 

gat atg gee gtg 



ttt aca ggc aat ttt 

Phe Thr Gly Asn Phe 
50 

cat gcg aag tac aga 

His Ala Lys Tyr Arg 
65 

ate get gga gca ccc 

He Ala Gly Ala Pro 
80 

cat gtg atg aaa gaa 

His Val Met Lys Glu 
95 

cga gtg etc aga att 

Arg Val Leu Arg He 

115 

gaa get cat aag cga 

Glu Ala His Lys Arg 
130 

cag gee gtt aag teg 

Gin Ala Val Lys Ser 
145 

ctt gtc gac aag atg 

Leu Val Asp Lys Met 
160 

gga gag teg gee ggt 



tta gac ate etc 

Leu Asp He Leu 
55 

gaa aaa tac gga 

Glu Lys Tyr Gly 
70 

gtc ttg aac teg 

Val Leu Asn Ser 
85 

gee tac gac tat 

Ala Tyr Asp Tyr 
100 

get acc gga gat 
Ala Thr Gly Asp 

cat cga agg ate 

His Arg Arg He 
135 

atg gtc cca att 

Met Val Pro He 
150 

atg gag gat gcg 

Met Glu Asp Ala 
165 

gaa aag aag gca 
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Ala Glu Lys Asp Met Ala Val Gly Glu Ser Ala Gly Glu Lys Lys Ala 
170 175 180 

acc aga etc gag ace gaa gga gtc gat gta aag gat tgg gtc ggt cga 
629 

Thr Arg Leu Glu Thr Glu Gly Val Asp Val Lys Asp Trp Val Gly Arg 
185 190 195 





get 


act ctg gac gtc 


atg get ctt gca 


gga ttt gac 


tat 


aag 


age 


gac 




Oft 


















Ala 


Thr Leu Asp Val Met Ala Leu Ala Gly Phe Asp Tyr Lys 


Ser Asp 




200 




205 


210 








215 




teg 


etc cag aac aag 


acc aat gag etc 


tat gtc get 


ttt 


gtc 


gga 


ctt 




/ ^» -J 


















Ser 


Leu Gin Asn Lys 


Thr Asn Glu Leu 


Tyr Val Ala 


Phe 


Val 


Gly Leu 






220 




225 






o "3 n 




w 


acc 


gat ggg ttt get 


cct acc ttg gac 


teg ttc aag 


get 


ate 


atg 


tgg 


m 


773 
















CP; 


Thr 


Asp Gly Phe Ala 


Pro Thr Leu Asp 


Ser Phe Lys 


Ala 


He 


Met 


Trp 


rp 




235 


240 






Z ^± D 






H 


gat 


ttt gta cct tac 


ttc cga act atg 


aaa egg aga 


cat 


gag 


ata 


cct 




821 
















m 


Asp 


Phe Val Pro Tyr 


Phe Arg Thr Met 


Lys Arg Arg 


His 


Glu 


He 


Pro 






250 


255 




260 








H ■: 


ttg 


act caa gga tta 


gca gtt tec cga 


cga gtt ggg 


ate 


gag 


ctt 


atg 


SB ij 


869 


















Leu 


Thr Gin Gly Leu Ala Val Ser Arg Arg Val Gly 


He 


Glu 


Leu 


Met 






265 


270 


275 












gag 


caa aag aag cag 


gee gtg ctt ggc 


tea get tec 


gat 


cag 


get 


gtt 




917 


















Glu 


Gin Lys Lys Gin Ala Val Leu Gly Ser Ala Ser Asp 


Gin 


Ala 


Val 




280 




285 


290 








295 




gat 


aaa aag gat gtt 


caa ggt egg gat 


ate eta agt 


etc 


eta 


gtg 


aga 




965 


















Asp 


Lys Lys Asp Val Gin Gly Arg Asp 


lie Leu Ser 


Leu 


Leu 


Val 


Arg 






300 




305 






310 
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gca aac ate gee 
1013 

Ala Asn lie Ala 
315 

gag gta etc get 
1061 

Glu Val Leu Ala 
330 

tct teg aca gtc 
1109 

Ser Ser Thr Val 
345 

gec gtt cag gat 
1157 

Ala Val Gin Asp 
360 

atg cct acg eta 
1205 

Met Pro Thr Leu 



gtt aag gag tct 
1253 

Val Lys Glu Ser 
395 

gaa tgc tta aag 
1301 

Glu Cys Leu Lys 
410 

ggt cga gat ggg 
1349 

Gly Arg Asp Gly 
425 

atg gtc atg ctt 
1397 

Met Val Met Leu 




gee aac ctg cct 
Ala Asn Leu Pro 

cag ate agt aac 

Gin lie Ser Asn 
335 

ttg aca tgg atg 

Leu Thr Trp Met 
350 

aaa ctt cga gaa 

Lys Leu Arg Glu 
365 

gac gaa ctt aat 

Asp Glu Leu Asn 
380 

ctt cgt eta gac 
Leu Arg Leu Asp 

gat gaa gac ttc 

Asp Glu Asp Phe 
415 

teg gtc ate aac 

Ser Val lie Asn 
430 

ccg ttg ttc aac 
Pro Leu Phe Asn 



gaa tct caa aag 

Glu Ser Gin Lys 
320 

ctg tta ttt get 
Leu Leu Phe Ala 

ttt cac cga etc 

Phe His Arg Leu 
355 

gaa att tgt cag 

Glu He Cys Gin 
370 

gcg ttg cct tat 

Ala Leu Pro Tyr 
385 

cct cct agt ccg 

Pro Pro Ser Pro 
400 

ate cca ctt gee 

He Pro Leu Ala 

gag gtc egg ate 

Glu Val Arg He 
435 

ate aat cgt tea 
He Asn Arg Ser 



ctg tec gat gag 

Leu Ser Asp Glu 
325 

gga tat gaa act 

Gly Tyr Glu Thr 
340 

tea gaa gac aaa 
Ser Glu Asp Lys 

ate gac acg gat 

He Asp Thr Asp 
375 

etc gaa gcg ttt 

Leu Glu Ala Phe 
390 

tat get aac cgt 

Tyr Ala Asn Arg 
405 

gag cct gtc att 

Glu Pro Val He 
420 

acg aaa gga acg 

Thr Lys Gly Thr 

aag ttc att tat 
Lys Phe He Tyr 
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440 445 450 455 

gga gaa gat gca gaa gaa ttc aga ccg gag agg tgg ctt gag gac gta 
1445 

Gly Glu Asp Ala Glu Glu Phe Arg Pro Glu Arg Trp Leu Glu Asp Val 

460 465 470 

aca gac teg etc aac agt att gaa gca ccc tat gga cac cag gcg age 
1493 

Thr Asp Ser Leu Asn Ser lie Glu Ala Pro Tyr Gly His Gin Ala Ser 

475 480 485 

ttt ate tct gga ccc aga get tgc ttt ggt tgg cga ttt get gtc gec 
1541 

Phe lie Ser Gly Pro Arg Ala Cys Phe Gly Trp Arg Phe Ala Val Ala 
490 495 500 

gag atg aag gec ttc ttg ttt gtc act etc cgt egg gtc cag ttc gag 
1589 

Glu Met Lys Ala Phe Leu Phe Val Thr Leu Arg Arg Val Gin Phe Glu 
505 510 515 

ccc ate ate tct cat cca gag tac gag cac ate acc ttg ate att tec 
1637 

Pro lie lie Ser His Pro Glu Tyr Glu His lie Thr Leu lie lie Ser 
520 525 530 535 

cgt cct cga ate gtt ggt " aga gag aag gag ggg tac cag atg cgt ttg 
1685 

Arg Pro Arg lie Val Gly Arg Glu Lys Glu Gly Tyr Gin Met Arg Leu 

540 545 550 

cag gtc aag ccg gtc gaa tga gttgattctt catatgttaa gagaagttct 
1736 

Gin Val Lys Pro Val Glu 
555 

atatctgaga atgtgtgact aggacaatgc cttctttgta tcgatttgtt tctcataccc 
1796 

gggcaggege tatgacttct aegtegtcta tcgtcgctct ggactctctt cttaccctat 
1856 
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atattattcc atccgaaaaa aaaaaaaaaa aaaaaaaaaa aaaaagcggc cgctcgagcc 
1916 

ggctcgtgcc gaattc 
1932 



<210> 3 
<211> 557 
<212> PRT 

<213> Phaffia rhodozyma 



<400> 3 





Met 


Phe 


He 


Leu 


Val 


Leu 


Leu 


Thr 


Gly Ala 


Leu 


Gly 


Leu 


Ala 


Ala 


Phe 




1 








5 










10 










15 






Ser 


Trp 


Ala 


Ser 


He 


Ala 


Phe 


Phe 


Ser 


Leu 


Tyr 


Leu 


Ala 


Pro 


Arg 


Arg 










20 










25 










30 






Ser 


Ser 


Leu 
35 


Tyr 


Asn 


Leu 


Gin 


Gly 
40 


Pro 


Asn 


His 


Thr 


Asn 
45 


Tyr 


Phe 


Thr 


•H : !: 

''3;.r ;! 

m 

-z; - 


Gly 


Asn 
50 


Phe 


Leu 


Asp 


He 


Leu 
55 


Ser 


Ala 


Arg 


Thr 


Gly 
60 


Glu 


Glu 


His 


Ala 




Lys 


Tyr 


Arg 


Glu 


Lys 


Tyr 


Gly 


Ser 


Thr 


Leu 


Arg 


Phe 


Ala 


Gly 


He 


Ala 




65 










70 










75 










80 


%il 


Gly 


Ala 


Pro 


Val 


Leu 


Asn 


Ser 


Thr 


Asp 


Pro 


Lys 


Val 


Phe 


Asn 


His 


Val 


3 










85 










90 










95 






Met 


Lys 


Glu 


Ala 


Tyr 


Asp 


Tyr 


Pro 


Lys 


Pro 


Gly 


Met 


Ala 


Ala 


Arg 


Val 


= ~r 

d= 

J** 








100 










105 










110 






Leu 


Arg 


He 


Ala 


Thr 


Gly 


Asp 


Gly 


Val 


Val 


Thr 


Ala 


Glu 


Gly 


Glu 


Ala 


■sss:. 






115 










120 










125 








?~ j: 


His 


Lys 
13 0 


Arg 


His 


Arg 


Arg 


He 
135 


Met 


He 


Pro 


Ser 


Leu 
140 


Ser 


Ala 


Gin 


Ala 




Val 


Lys 


Ser 


Met 


Val 


Pro 


He 


Phe 


Leu 


Glu 


Lys 


Gly 


Met 


Glu 


Leu 


Val 




145 










150 










155 










160 




Asp 


Lys 


Met 


Met 


Glu 
165 


Asp 


Ala 


Ala 


Glu 


Lys 
170 


Asp 


Met 


Ala 


Val 


Gly 
175 


Glu 




Ser 


Ala 


Gly 


Glu 
180 


Lys 


Lys 


Ala 


Thr 


Arg 
185 


Leu 


Glu 


Thr 


Glu 


Gly 
190 


Val 


Asp 




Val 


Lys 


Asp 
195 


Trp 


Val 


Gly 


Arg 


Ala 
200 


Thr 


Leu 


Asp 


Val 


Met 
205 


Ala 


Leu 


Ala 




Gly 


Phe 
210 


Asp 


Tyr 


Lys 


Ser 


Asp 
215 


Ser 


Leu 


Gin 


Asn 


Lys 
220 


Thr 


Asn 


Glu 


Leu 




Tyr 


Val 


Ala 


Phe 


Val 


Gly 


Leu 


Thr 


Asp Gly 


Phe 


Ala 


Pro 


Thr 


Leu 


Asp 




225 










230 










235 










240 
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Ser Phe Lys Ala lie Met Trp Asp Phe Val Pro Tyr Phe Arg Thr Met 

245 250 255 

Lys Arg Arg His Glu lie Pro Leu Thr Gin Gly Leu Ala Val Ser Arg 

260 265 270 

Arg Val Gly lie Glu Leu Met Glu Gin Lys Lys Gin Ala Val Leu Gly 

275 280 285 

Ser Ala Ser Asp Gin Ala Val Asp Lys Lys Asp Val Gin Gly Arg Asp 

290 295 300 

lie Leu Ser Leu Leu Val Arg Ala Asn lie Ala Ala Asn Leu Pro Glu 
305 310 315 320 

Ser Gin Lys Leu Ser Asp Glu Glu Val Leu Ala Gin lie Ser Asn Leu 

325 330 335 

Leu Phe Ala Gly Tyr Glu Thr Ser Ser Thr Val Leu Thr Trp Met Phe 

340 345 350 

His Arg Leu Ser Glu Asp Lys Ala Val Gin Asp Lys Leu Arg Glu Glu 

355 360 365 

lie Cys Gin lie Asp Thr Asp Met Pro Thr Leu Asp Glu Leu Asn Ala 

370 375 380 

Leu Pro Tyr Leu Glu Ala Phe Val Lys Glu Ser Leu Arg Leu Asp Pro 
385 390 395 400 

Pro Ser Pro Tyr Ala Asn Arg Glu Cys Leu Lys Asp Glu Asp Phe lie 

405 410 415 

Pro Leu Ala Glu Pro Val lie Gly Arg Asp Gly Ser Val lie Asn Glu 

420 425 430 

Val Arg lie Thr Lys Gly Thr Met Val Met Leu Pro Leu Phe Asn lie 

435 440 445 

Asn Arg Ser Lys Phe lie Tyr Gly Glu Asp Ala Glu Glu Phe Arg Pro 

450 455 460 

Glu Arg Trp Leu Glu Asp Val Thr Asp Ser Leu Asn Ser lie Glu Ala 
465 470 475 480 

Pro Tyr Gly His Gin Ala Ser Phe lie Ser Gly Pro Arg Ala Cys Phe 

485 490 495 

Gly Trp Arg Phe Ala Val Ala Glu Met Lys Ala Phe Leu Phe Val Thr 

500 505 510 

Leu Arg Arg Val Gin Phe Glu Pro He He Ser His Pro Glu Tyr Glu 

515 520 525 

His He Thr Leu He He Ser Arg Pro Arg He Val Gly Arg Glu Lys 

530 535 540 

Glu Gly Tyr Gin Met Arg Leu Gin Val Lys Pro Val Glu 
545 550 555 
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<210> 4 
<211> 3969 
<212> DNA 

<213> Phaffia rhodozyma 
<220> 

<221> 5'UTR 

<222> (517) . . (518) 

<220> 

<221> intron 
<222> (784) . . (898) 

<220> 

<221> intron 

<222> (1016) . . (1087) 

<220> 

<221> intron 

<222> (1180) . . (1302) 

<220> 

<221> intron 

<222> (1518) .. (1600) 

<220> 

<221> intron 

<222> (1635) . . (1723) 

<220> 

<221> intron 

<222> (1867) . . (1939) 

<220> 

<221> intron 

<222> (2000) . . (2081) 

<220> 

<221> intron 

<222> (2182) . . (2257) 

<220> 

<221> intron 



<222> 



(2355) . . (2431) 



<220> 



<221> intron 

<222> (2543) . . (2618) 

<220> 

<221> intron 

<222> (2653) . . (2742) 

<220> 

<221> intron 

<222> (2815) . . (2962) 

<220> 

<221> intron 

<222> (3051) . . (3113) 

<220> 

<221> intron 

<222> (3172) . . (3247) 

<220> 

<221> intron 

<222> (3322). . (3398) 

<220> 

<221> intron 

<222> (3424) . . (3513) 

<220> 

<221> polyA_site 
<222> (3865) (3866) 

<220> 

<221> intron 
<222> (653) . . (734) 

<400> 4 

cggaccgaag cctcgccagc agttgatcaa gcgaaccaag ccgaacaatc ctggcgcgcc 
60 

tggaggagcg ggagcgggag gagcagcagg tgatgcatcg ggtggacaga atcagtagtg 
120 
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tgtgtgtatg tgtgtagtgt agttgggttg 
180 

tctttaaaat ctctacattg aatgtttacc 
240 

gttgtgtaac cagttcgatt gagattacga 
300 

atctagagcg acatctggct cgatcggtcc 
360 

gaggaacgcc gaggccgaac gagacagaca 
420 

gatcgttttt acatactacc cgtcgattct 
480 

gcattctcta tctcgtgtaa catcgatccg 
540 

atcttggtct tgctcacagg tgctttaggc 
600 

ttcttcagtc tttacctcgc tccgaggcga 
660 

tgagctctgg aatcatgctt gtgtaaatcc 
720 

catcctctct tcaggcccga atcataccaa 
780 

ctcgtgagtt ttcatcattg gctcagtcgt 
840 

tcggacgcgt tcttctttct atgtgaaatc 
900 

ctcgtacagg tgaagagcat gcgaagtaca 
960 

ctgggatcgc tggagcaccc gtcttgaact 
1020 

tccatccgaa ccctcatcct cctctgctga 
108 0 

tggacagtgt gatgaaagaa gcctacgact 
1140 

tcagaattgc taccggagat ggtgttgtta 
1200 

atcacatcta atccactcgg cgcgattgaa 
1260 

ttttcacttc atggtctcgg tgcatcttgt 
1320 

atcgaaggat catgatcccc tctctgtccg 
1380 

tcttagaaaa aggtatggaa cttgtcgaca 



tcccatgtgc ttcttcttat catcatcatt 
ggaacgggct ttgatgatac tacggaccac 
ttagatagcc gatccgtcga tcagatctcg 
ttgccgaaaa tcagggcacc gatcagggca 
caccatcatc atcagccatg tcttttttgt 
aaccttcttt cttcttctct tgccatcttt 
attcttgcca cctactttct ccatatgttc 
ctggctgctt tctcatgggc atccatagcg 
tcttcactgt ataaccttca gggtaagaat 
tataatctca ttcatcctat tcctcttctt 
ctactttaca ggcaattttt tagacatcct 
ccaatcttaa cgatcatcgc taacgacctt 
tgatctttgg tttgttacga gagcacagag 
gagaaaaata cggaagcacc ctccggtttg 
cgaccgatcc gaaagtcttc aaccagtttg 
tcaattcaac tgtagttaac gcactttgaa 
atccgaaacc tggtatggcc gctcgagtgc 
cggcggaagg tgcttttcaa gttctcttat 
ctcaacattt ctgacgagcc tgtcaccttg 
ctcatctcat aggtgaagct cataagcgac 
ctcaggccgt taagtcgatg gtcccaattt 
agatgatgga ggatgcggct gagaaggata 
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1440 

tggccgtggg agagtcggcc ggtgaaaaga 
1500 

atgtaaagga ttgggtcgtg agtacccgcc 
1560 

caaggaaagg ttcattgact gacaaacact 
1620 

catggctctt gcaggtcagt ctactctctc 
1680 

ctgacatgct cttcctatat tcgatacgac 
1740 

gactcgctcc agaacaagac caatgagctc 
1800 

tttgctccta ccttggactc gttcaaggct 
1860 

actatggtat gtctgccatt ctttgatatc 
1920 

tttcacctat cgtgaacaga aacggagaca 
1980 

ttcccgacga gttgggatcg taagtgccag 
2040 

atcttaacct cctaggctca ttcatccatg 
2100 

agcaggccgt gcttggctca gcttccgatc 
2160 

gggatatcct aagtctccta ggttagtaac 
2220 

ttctttccct gacaactgtc aacatgctcg 
2280 

ctgcctgaat ctcaaaagct gtccgatgag 
2340 

tttgctggat atgagtgtgt atcctttccc 
2400 

atagaggtct ttatgtttcc tgtttgatca 
2460 

ttcaccgact ctcagaagac aaagccgttc 
2520 

tcgacacgga tatgcctacg ctgtgaggat 
2580 

aatgactaaa acggccttcc attcttgatc 
2640 

tatctcgaag cggttggttc tcgattcttg 
2700 



aggcaaccag actcgagacc gaaggagtcg 
tattccttca ccttgatgga cgaagcatat 
atcttaccag ggtcgagcta ctctggacgt 
ttataaatgc tccacatatg tatgcatgta 
gtcatatgtc caggatttga ctataagagc 
tatgtcgctt ttgtcggact taccgatggg 
atcatgtggg attttgtacc ttacttccga 
caaagattat ggataggtta cttgctaaaa 
tgagatacct ttgactcaag gattagcagt 
atcaagcctc tctgaatatt cttggtcatc 
gtgcgcaata ggagcttatg gagcaaaaga 
aggctgttga taaaaaggat gttcaaggtc 
gtttttaaac gtatatacag agcggcgaca 
ttactagtga gagcaaacat cgccgccaac 
gaggtactcg ctcagatcag taacctgtta 
ctctctatcc ttagctgatt aaaagcacta 
gaacttcttc gacagtcttg acatggatgt 
aggataaact tcgagaagaa atttgtcaga 
gtttttgatg ctaaattact tcttcttgca 
cattttagag acgaacttaa tgcgttgcct 
gtcttgtctt ccaaatacaa tacggattat 
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tgctcatctg atttgcgtct acgggctgtg 
2760 

cgtctagacc ctcctagtcc gtatgctaac 
2820 

tggcttcatc acgcataatt ttcatttcat 
2880 

cgagctcaaa ttccggcttc ctcttctgtg 
2940 

cttcaaccaa aatttgtcac agttcatccc 
3000 

gtcggtcatc aacgaggtcc ggatcacgaa 
3060 

ctttatttca tctcgtccat gaaatagttt 
312 0 

caacatcaat cgttcaaagt tcatttatgg 
3180 

gttttctttt aaaagccaat cggtttcgta 
3240 

tcgaaagacc ggagaggtgg cttgaggacg 
3300 

cctatggaca ccaggcgagc tgtatgtttt 
3360 

cgttgaactt cgcgcttccc ttgttgttga 
3420 

cttgtaagtt tcttctcatc tggcgcctta 
3480 

tacgattgtt tctgactctc tcgactttcg 
3540 

gaaggccttc ttgtttgtca ctctccgtcg 
3600 

agagtacgag cacatcacct tgatcatttc 
3660 

ggggtaccag atgcgtttgc aggtcaagcc 
3720 

gagaagttct atatctgaga atgtgtgact 
3780 

tctcataccc gggcaggcgc tatgacttct 
3840 

cttaccctat atattattcc atccgtctgt 
3900 

aactcaatat tcagcctctt catgcttctg 
3960 

cgactgcag 



gaatttaact agtttgttaa ggagtctctt 
cgtgaatgct taaaggatga agacgtatgt 
attcctttgt acatacgcat acaggctgac 
cttctttttc tggcctttct tatcttcatt 
acttgccgag cctgtcattg gtcgagatgg 
aggaacgatg gtcatgcttc gtaagttttc 
ctgatagacg cggaccaatt cagcgttgtt 
agaagatgca gaagaattca ggtacaattc 
tcgtaattga ccgggctctc ttttaatttc 
taacagactc gctcaacagt attgaagcac 
attgatttta tctttgtgaa ttttgcaaaa 
aatcccagtt atctctggac ccagagcttg 
gcagtatccg atcagccatc tagttctttg 
cagtggttgg cgatttgctg tcgccgagat 
ggtccagttc gagcccatca tctctcatcc 
ccgtcctcga atcgttggta gagagaagga 
ggtcgaatga gttgattctt catatgttaa 
aggacaatgc cttctttgta tcgatttgtt 
acgtcgtcta tcgtcgctct ggactctctt 
atatttgtct atcacgacgt ctgtgtcgtc 
tgtctccata gatgtgatct tcatgtttgt 



Page 15 



3969 



<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sense primer 
for expression of the AST gene in E. coli 

<400> 5 

gttcaaagtt catttatgga 
20 



<210> 6 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Antisense 

primer for expession of the AST gene in E. coli 

<400> 6 

ggatcctcag tggtggtggt ggtggtgttc gaccggcttg acctgca 
47 



<210> 7 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 5' sense 

primer for expression of a modified AST gene in E. 
coli 

<400> 7 

catatgcacc accaccacca ccacctgtat aaccttcagg ggccc 
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45 



<210> 8 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 5' antisense 

primer for expression of a modified AST gene in E. 
coli 

<400> 8 

gtaacaacac catctccggt 
20 



<210> 9 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 3' anti sense 
primer for expression of a modified AST gene in E. 
coli 

<400> 9 

ggatcctcaa ctcattcgac cggctt 
26 



<210> 10 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Genome walking 
primer for cloning of the AST gene 

<400> 10 
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tagagagaag gaggggtacc agatgc 
26 



<210> 11 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Antisense 

primer for cloning of the terminator region of the 
AST gene 

<400> 11 

ccccggattg tggagaaact 
20 . 



<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sense primer 
for cloning the genomic AST gene 

<400> 12 

atgttcatct tggtcttgct 
20 



<210> 13 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Antisense 
primer for cloning the genomic AST gene 

<400> 13 
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acgtagaagt catagcgcct 
20 



<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sense primer 
for RT-PCR of the AST gene 

<400> 14 

tttgactcaa ggattagcag 
20 



<210> 15 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Antisense 
primer for RT-PCR of the AST gene 

<400> 15 

tgtcttctga gagtcggtga 
20 



<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerate 
sense primer for cloning of the TPI gene 

<220> 

<221> misc feature 
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<222> 3 

<223> n is a or c or g or t 
<220> 

<221> misc_feature 
<222> 6 

<223> n is a or c or g or t 
<220> 

<221> misc_feature 
<222> 15 

<223> n is a or c or g or t 
<220> 

<221> misc_f eature 
<222> 18 

<223> n is a or c or g or t 
<220> 

<221> misc_feature 
<222> 21 

<223> n is a or c or g or t 
<400> 16 

mgnacnttyt tygtnggngg naay 
24 



<210> 17 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerate 
antisense primer for cloning the TPI gene 

<220> 

<221> misc_feature 
<222> 3 

<223> n is a or c or g or t 
<220> 
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• # 

<221> misc_f eature 
<222> 6 

<223> n is a or c or g or t 
<220> 

<221> misc_f eature 
<222> 9 

<223> n is a or c or g or t 
<220> 

<221> misc_f eature 
<222> 12 

<223> n is a or c or g or t 



misc_feature 
18 

n is a or c or g or t 

W 
Hi 

misc__feature 
24 

n is a or c or g or t 
<400> 17 

™!| gcnccnccna cnarraancc rtcnacrtc 

A 29 

ry <2io> is 

<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primary- 
walking primer for cloning of the TPI terminator 

<400> 18 

gcttacctcg cttccaacgt ttcccag 
27 



<220> 
<221> 
<222> 
<223> 



m 
m 



<220> 
<221> 
<222> 
<223> 
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<210> 19 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nested walking 
primer for cloning of the TPI terminator 

<400> 19 

ggatctgtct ctgcctccaa ctgcaag 
27 



<210> 20 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primary 

walking primer for cloning of the TPI promoter 

<400> 20 

gggtcaatgt cggcagcgag aagccca 
27 



<210> 21 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nested walking 
primer for cloning of the TPI promoter 

<400> 21 

atgtactcgg tagcactgat caagtag 
27 



<210> 22 
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• 



<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sense primer 
for construction of the TPI promoter cassette 

<400> 22 

gcggccgcat ccgtctcgcc atcagtct 
28 



<210> 23 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Antisense 
primer for construction of the TPI promoter 
cassette 

<400> 23 

cctgcaggtc tagagatgaa taaatataaa gagt 
34 



<210> 24 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sense primer 
for construction of the TPI terminator cassette 

<400> 24 

cctgcaggta aatatatcca gggattaacc ccta 
34 



<210> 25 
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• # 

<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Antisense 
primer for construction of the TPI terminator 
cassette 

<400> 25 

ggtacccgtg cgcagtcgac cgagacat 
28 



<210> 26 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerate 
sense primer for cloning of the AMY gene 

<220> 

<221> misc_feature 
<222> 15 

<223> n is a or c or g or t 
<220> 

<221> misc_feature 
<222> 21 

<223> n is a or c or g or t 
<220> 

<221> misc_feature 
<222> 27 

<223> n is a or c or g or t 
<220> 

<221> misc_feature 
<222> 30 

<223> n is a or c or g or t 
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<400> 26 

gaytayathc arggnatggg nttyrmngcn athtg 
35 



<210> 27 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerate 
antisense primer for cloning of the AMY gene 

<220> 

<221> misc_feature 
<222> 6 

<223> n is a or c or g or t 
<220> 

<221> misc_feature 
<222> 9 

<223> n is a or c or g or t 
<220> 

<221> misc_feature 
<222> 24 

<223> n is a or c or g or t 
<220> 

<221> misc_feature 
<222> 30 

<223> n is a or c or g or t 
<400> 27 

tgytcngtnc crtartadat datnggdatn ccrtc 
35 



<210> 28 
<211> 26 ■ 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Sense primer 
for construction of a partial AMY cassette 

<400> 28 

ccgcggcatt gatacctcta ccccgt 
26 



<210> 29 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Antisense 

primer for construction of a partial AMY cassette 

<400> 29 

gcggccgcct gcaatcctgg atccaccg 
28 



<210> 30 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sense primer 
for construction of the AST cassette 

<400> 30 

tctagaatgt tcatcttggt cttgctca 
28 



<210> 31 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
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# # 

<220> 

<223> Description of Artificial Sequence: Antisense 
primer for construction of the AST cassette 

<400> 31 

cctgcaggtc attcgaccgg cttgacct 
28 



<210> 32 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sense primer 
for confirmation of integration at the AMY locus 
by PCR analysis 

<400> 32 

ctctcctgtt cacaaaaaca 
20 
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